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ABSTRACT 


Recent  vert  with  Automatic  Speech  Recognition  has 
focusea  on  applications  and  productivity  considerations  in 
the  man-rrachine  interface.  This  thesis  is  an  atterrpt  to  see 
if  placing  users  cf  such,  equipment  urder  time-induced  stress 
has  an  effect  on  their  percentage  correct  recognition  rates. 
Suojects  were  given  a  message-handling  tasK  of  fixed  length 
and  allowed  progressively  shorter  tiires  to  attempt  to 
complete  it.  Cuestionnaire  responses  indicate  stress  levels 
increased  with  decreased  time-allowance )  recognition  rates 
decreased  as  tine  was  reduced. 
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I.   INTRODUCTION 

Stress  has  long  been  an  intensely  active  area  of 
investigation.  Pore  recently,  the  intersection  of  physiology 
and  psychology  has  concerned  itself  with  the  influence  of 
cognitive  and  errotive  factors  on  quantifiable  variables  in 
the  human  organism's  equation.  This  research  has  turned  frorr 
the  most  fundamental  indices  of  human  reaction  (cardiac 
rate,  electrogal vanic  response)  to  more  subtle  measures  such 
as  various  features  of  the  human  voice. 

Stress,  however  defined,  has  clear  and  important 
implications  for  the  conduct  of  military  operations.  In  the 
warfare  environment,  one  encounters  some  of  the  few  factors 
universally  acknowledged  to  cause  psychological  stress:  it 
is  one  of  the  few  areas  where  one  knows  apriori  that  life- 
threatening  situations  are  consistently  found.  Thus  the 
psycho-physiological  correlates  of  military  action  have 
furnished  both  an  area  of  inquiry  and  the  raw  data  for  those 
interested  in  the  study  of  the  human  under  stress.  Swedish 
researchers  took  an  early  lead  in  the  investigation  of 
stress  in  the  military  environment.  [Ref.l] 

The  investigation  of  any  phenomenon  requires  first  and 
foremost  a  statement  delimiting  the  area  of  research.  Stress 
has  yet  to  receive   a  widely  accepted  definition.   Stress 
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research  tends  to  divide  inxc  twc  camps:  these  interested  in 
physiological  stress  and  those  pursuing  the  psychologic<?l 
variety.  Both  ciairr  the  territority  of  psychopbysiology . 
Beyond  this,  the  definition  of  stress  break's  down,  in  the 
rrain,  into  an  exaFination  of  either  a  stimulus  or  a 
response:  there  appears  tc  be  nc  common  ground.  Cne  thine 
which  emerges  clearly  frcrr  published  research  in  the  area  is 
that  a  synthetic  conception  of  stress  is  in  order,  one  bread 
enough  to  cover  the  wide  (and  widening)  area  of  inquiry 
already  embarked  upon,  and  which  would  permit  breakdown  into 
narrower,  more  specialized  areas  of  interest. 

Given  the  vagaries  of  the  published  resources  currently 
available,  it  is  necessary  within  the  context  of  the  present 
paper  to  work  within  an  operationally-defined,  problem- 
oriented  view  of  stress.  Ir:m  a  military  commander's  point 
of  view,  an  area  of  great  potential  interest  is  the 
interaction  of  stress  with  voice-input  to  computer  systems. 
As  miniaturization  and  expanding  technology  have  made 
computers  a  more  widespread  phenomenon  in  the  military 
environment,  the  limiting  factor  (in  the  military  just  as 
with  civilian  applications)  continues  to  be  the  human 
interface  with  the  ever-  expanding  base  of  information  and 
decision  aids  available.  Voice-input  to  computers  provides 
the  field  user  with  a  higher-order  interface,  at  a  level 
abeve  even  user-friendly  query  languages.  This  type  cf 
interface  has  the  potential  to  facilitate  the  user's  ability 
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to  access  and  manipulate  the  vast  resources  new  becoming 
available  to  hirr:  it  is  the  next  logical  step  in  the 
evolution  of  rran-rrachine  con  patabili t/.  Computers  are  now 
able  to  handle  problems  which  not  long  ago  would  have 
absorbed  an  enorrrous  amount  of  man-hours,  hut  the  man  is 
still  "in  the  loop". 
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Automatic  Speech  Recognition  (ASH)  is  a  demonstrated, 
on-line  technology  of  clear  interest  to  the  military 
commander.  (Fcr  a  discussion  of  the  range  of  voice  input 
applications,  the  reader  is  referred  to  References  [3]  and 
[4])  The  rapidity  with  which  current  data  processing  and 
computer  networks  now  operate  necessarily  imply  that  the 
machinery  spends  an  inordinate  part  of  its  time  waiting  for 
input  from  the  human  operator.  Speeding  up  this  aspect  of 
tne  interface,  a  quantum  leap  in  user-accomodat ion ,  is  one 
of  the  major  contributions  to  be  expected  (and  even  new 
available)  frorr  speech  recognizers  linking  the  human  to  the 
computer.  Speech  reccgnitcn  systems,  while  they  generally  do 
not  affect  the  internal  workings  of   the   corrputer,   present 
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their  greatest  potential  advantage  in  increasing  the 
efficiency  of  the  total  human/machine  system.  Industry  vas 
quick  to  recognize  this  potential:  voice  input  systems  are 
presently  in  use  in  various  sorting  and  assembly-line 
applications . 

Computer  input  by  voice  presents  several  advantages  in 
the  military  context,  especially  in  the  Command  Fost 
environment.  It  frees  operators'  hands  to  enable  other 
concurrent  tasks,  such  as  transcription  or  manipulation  of 
other  equipment,  increasing  productivity  in  hoth  areas. [Ref. 
2]  Just  as  importantly,  it  provides  the  potential  for  the 
voice-operator  to  maintain  a  graphic  or  figurative  plot  of  a 
situation  he  may  be  monitoring  at  the  terminal.  This  would 
allow  others  not  familiar  or  comfortable  with  the  computer 
to  comprehend  and  follow  what  the  operator  fray  see  unfolding 
at  his  worK  station,  without  requiring  lengthy  and 
distracting  explanations  from  the  operator.  Voice-input 
could  enable  the  operator  to  continue  his  task  at  the 
terminal  and  simultaneously  manipulate  a  visual 
representation  of  the  problem  he  is  involved  in,  for  others' 
benefit.  This  is  a  potential  boon  in  the  period  of 
transition  frcrr  a  symbolic  gestait  to  an  era  cf  much  more 
widespread  computer  literacy. 

Moreover,  cognitive  psychology  has  presented  a 
scientific  model  for  task  suitability  to  different 
modalities  which  takes  the  criteria  for  task  assignment   far 
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teycnd  the  cbvious  basis  ci'  "ease".  Sandry  and  Wick:  ens  [Bef. 
t:  pp. 8-10]  have  proposed  the  concept  of  "ideorotor 
compatabili ty" :  human  input/output  rrodes  rray  be  rratched  to 
the  senses  and  hemispheres  cf  the  brain  which  most  readily 
process  then.  Quicker,  rrore  accurate  responses  result  when 
responses  are  rratched  to  stimuli  (such  as  hearing  and 
speaking) .  Short  term  memory  for  verbal  information  is 
significantly  improved  when  that  information  is  presented 
auditorily . 

There  ere  cbvious  correlates  of  ideomotor  compa tabili ty 
in  a  situation  such  as  a  Command  Pest.  The  commander  gives 
an  operator  verbal  instructions  regarding  a  tastr,  and  this 
operator  then  executes  these  with  voice  input  to  a  terminal. 
In  competing  fcr  the  human  operator's  mental  resources,  it 
is  Sandry  and  Wickens'  thesis  that  no  time-sharing  decrement 
results  when  competing  tasks  are  "idecmotor  compatible". 

The  automated  resources  available  to  the  commander  in 
the  current  environment  are  expanding  and  gaining 
increasingly  widespread  acceptance.  The  applicability  of 
voice  input  to  computer  systems  is  therefore  >icxh 
intuitively  and  logically  attractive,  and  has  been  gaining 
increasing  attention.  Stress,  however  defined,  is 
universally  conceded  to  be  a  factor  in  the  human  side  cf 
this  equation,  and  therefore  to  has  implications  cf  military 
interest  in  the  command  and  control  environment . 
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Given  these  considerations,  what  then  might  re  the 
effect  of  stress  en  an  operator  of  a  voice  input  system?  In 
various  areas,  research  en  psychological  stress  abounds,  and 
researchers  are  now  turning  their  attentions  specifically  to 
the  effects  of  stress  en  the  voice.  The  published  research 
in  this  latter  area  is,  in  the  rrain,  inconclusive.  It  is  The 
intent  of  this  thesis  to  see  if  sorre  relationship  exists 
between  stress  incident  en  a  voice-input  operator  and  his 
success  rate  with  an  automatic  speech  recognizer.  It  is  to 
De  emphasized  that  virtually  no  wcrK  has  been  published  in 
this  particular  area.  The  current  study  proposes  itself  as  a 
preliminary  inquiry  into  a  phenorenon  of  potential 
significance,  and  cne  which  has  as  yet  received  little 
experimental  attention. 
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II.  V0IC5:  RECOGNITION  AND  STKISS 

A.   DEFINITION 

Central  to  any  study  of  the  interaction  cf  stress  and 
the  output  of  voice  recognizers  is  a  definition  of  what  is 
to  be  understood  cy  "stress".  As  stated  (cf.  supra),  the 
disagreement  in  defining  stress  is  so  fundamental  that  even 
recent  treatments  differ  in  classing  stress  as  stimulus  cr 
response.   In  certain  areas,  however*  consensus  emerges. 

Stress  m  the  physiological  sei.se  is  more  clear-cut  than 
the  psychological.  Physiologists  may  refer  unambiguously  to 
Durran  stressors  such  as  neat,  pressure,  electric  shock,  ana 
the  line,  in  the  way  cf  stimuli.  Th e  resultant  reactions  are 
also  referred  to  as  stress,  ana  are  treasured  in  cardio- 
pulmonary parameters,  pupil  dilation,  and  similar  observaole 
and  quantifiaoie  phenomena.  When  speaking  of  physiological 
stress,  it  suffices  to  specify  whether  stimulus  or  response 
is  meant.  Pany  stressor-agents  have  been  extensively 
studied  ooth  in  this  country  and  abroad  for  their  military 
impacts . 

Ihe  grey  area  is  entered  with  psychological  stress  and 
its  causes  or  manifestations.  Various  researchers  define 
stress  as  the  organism's  reaction  (coping  or  adaptive)  to  an 
unfriendly  environnent.  Others  aefine  stress  as  the  inimical 
influence  of  the  environment  on  the  individual.  In  recent 
usage,    the   former  definition  appears   to   have  gained 
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ascendancy  over  the  latter,  although  the  debate  is  far  from 
settled. 

It  is  therefcre  incumbent  tc  define  psychological 
"stress"  before  proceeding  further.  The  definiton  must  be 
sufficiently  fcroaa  to  span  a  wide  gamut  of  potential 
phenomena,  and  thus  preserve  meairgfulness  .  In  addition,  it 
must  be  sufficiently  narrow  to  he  useful  and  workable  within 
the  necessarily  limited  scope  of  this  experiment.  A 
synthetic  definiti:n  or  stress  will  here  mean  "the  arousal 
produced  within  the  individual  by  unwelcome  stimulus  cr 
threat".  This  is  the  operational  definition  which  will  be 
adopted  in  an  atterpt  to  examine  toe  end-effect  of  producing 
this  type  of  ar:usai  in  sucjects  interacting  with  an 
automatic  speech  recognizer. 

There  is  extensive  literature  on  the  subject  of 
psychological  stress  (much  of  which  falls  within  the 
definition  above;  and  its  correlates,  especially  the 
physiological  correlates.  The  i\awy  has  done  extensive  worn: 
in  the  field,  mcst  notably  in  Sar  Diego.  [Ref .  6] 
Psychological  stress  may  be  further  broKen  down  into 
interesting  subcategories.  1)  Situational  stress  is 
generally  held  tc  be  the  influence  on  the  individual  of 
unfavorable  environmental  (out  non-physical)  factors.  These 
factors  are  beyond  the  individual's  control  and  are 
generally  thought  to  consist  of  a  certain  set  of 
circumstances   such  as  public  speaking,  deadlines,  and  the 
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lixe.  c)  Self-induced  stress  is  generally  recognized  as  the 
self-irrposi tion  of  a  condition  or  stimulus  leading  to  the 
type  of  arousal  defined  above.  Ixarrples  include  goals, 
deadlines,  or  performance  requi  reirents  of  any  type  with 
which  the  individual  forces  himself  to  function  atove  a 
"ccmf crtacle"  cr  "easy"  level. 

These  distinctions  often  rrerge,  however.  In  a  prime 
example,  Swedish  rriiitary  pilots  were  given  a  relatively 
complex  set  of  procedures  to  execute  in  an  unsophisticated 
flight  simulator,  and  promised  a  reward  for  high 
performance.  [Ref .  ?]  The  situation  was  imposed  en  them, 
and  could  oe  construed  as  situational  stress.  The  desire  to 
perform  at  a  high  level,  however,  was  self-induced.  An 
interesting  outcome  of  the  experiment  was  that  the 
physiological  indicators  of  the  pilots  in  question  reacted 
remarkably  liue  those  of  pilots  in  a  similar  non-simulated 
situation.  A  further  interpretation  of  these  studies  was 
that  subjects  had  come  to  accept  the  reality  of  the 
simulated  situations,  and  their  physiological  indicators 
reflected  as  much.  Such  studies,  in  general,  tend  to  support 
the  validity  of  simulated  situations  in  conducting  stress 
research. 

B.   STRESS  AM  THE  VOICE 

Lespite  the  abundance  of  general  stress  literature, 
relatively  little  has  Deen  done  on  how  this  phenomenon  might 
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iranifest  itself  in  the  human  voice.  Hicks  [Ref.  8]  gives  a 
thoughtful  overview  of  the  recent  literature,  in  addition  to 
his  own  experimental  results  with  various  forms  of  stress 
ana  voice.  He  concedes  the  generally  accepted  notion  that 
stress  has  an  influence  on  individuals'  fundanental  speaking 
frequency  (lo)>  he  concludes  (as  do  others)  that  this 
parameter  is  as  likely  to  go  up  as  down,  with  concomitant 
frequency-range  variations.  The  magnitude  and  direction  of 
this  shift  in  2o  is  a  function  of  the  individual  speaker. 
Additionally ,  tne  amount  of  shift  required  tc  constitute  an 
indication  of  stress  is  still  a  sutjective  assessment.  The 
one  conclusive  result,  reported  by  hicks  and  others  [Ref.  6: 
p.9y,  p.l£4j  is  that  the  ratio  of  speaking  time  to  total 
time  will  increase  in  a  stressed  speaker.  This  is  consistent 
with  everyday  experience:  "stressed"  speakers  often  appear 
to  talx  in  longer  oursts,  with  shorter  pauses  separating 
them.  Parameters  other  than  the  speech-to-pause  ratio  give 
inconclusive  or  contested  results. 

A  large  amount  of  commercial  interest  has  focused  on  the 
influence  of  stress  on  vocai  parameters.  One  can  imagine  the 
potential  interest  in  being  able  to  evaluate  the  stress 
level  of  a  speaker.  If  done  hy  voice,  this  analysis  could  be 
performed  at  great  distances  (via  telephone,  radio,  etc.) 
and  without  the  speaker's  knowledge.  Several  firms  have 
marketed  devices  purported  to  analyze  the  stress  level  of 
speakers   (as   a   sort  of  voice  polygraph),  but  their  claims 
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appear  subject  to  delate  ana  are  hotly  contested  in  academic 
circles.  These  devices  base  their  analysis  en  the  existence 
of  microtremor  in  the  muscles  controlling  vocal  activity,  a 
phenomenon  which  rerains  to  De  showr  conclusively.  [Rei*.  9: 
p. 175] 

i\orever,  the  readout  proviaed  ty  voice  stress  analyzers 
requires  rather  subjective  interpretation.  As  noted,  vocal 
variations  so  far  demonstrated  have  shown  a  subject-specific 
nature:  where  one  subject's  1c  rises,  another's  may  fall, 
etc.  Thus,  analysis  of  a  speech  sarrjie  for  stress  requires 
some  apriori  Knowledge  of  the  individual  speaker's 
tendencies.  Analytic  efforts  so  far  have  "been  limited  by  the 
necessity  for  rather  subjective  analysis.  Work  is  currently 
underway  both  to  objectify  such  analysis  and  improve  the 
analytical  algorithms  involved. 

In  snert,  there  currently  no  universal  and  few 
conclusive  indicators  of  stress  in  a  speaker's  voice.  It  is 
widely  accepted  that  psychological  stiess  has  an  influence 
on  vocal  production,  yet  hotly  contested  in  academic  and 
industrial  circles  what  the  best  indication  cf  such  stress 
is.  In  a  military/applications  context,  it  may  well  be  that 
the  most  fruitful  area  of  inquiry  would  be  the  implications 
cf  stress  upon  the  voice,  from  the  viewpoint  of  productivity 
and  reliability  with  autonatic  speech  recogoizers. 


^"Conversations  with  Dr.  D.  Biers,  University  of  Dayton, 
October  1962. 
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III.  EXPERIMENTAL  COMPONENTS 

A.   GENERAL 

It  is  generally  recognized  to  De  important  to   simulate 

in  an  experiment  the   conditions  UDaer  which  the  phenomena  or 

interest  might  actually  be  expected  tc   occur.  This   is   nc 

less   important   in   experiments   dealing  with  voice  input, 

vhere  there  is  a  clearly  definable  task  and  one  suspects   an 

operative    out    unquantif iable    factor   such  as   stress. 
Moreover,  with  voice  input,  it  is  essential  to  use   subjects 

representative   of   those  who  might  actually  be  using  ASF.  in 
the  tasK-envircnment  of  interest. 

P.   SUBJECTS 

Ti*enty-fcur  volunteer  sutjects  were  obtained,  mainly 
from  the  C2  curriculum  at  the  Naval  Postgraduate  school.  The 
makeup    of    the    sarrpie    is    shown    below: 


Taole    1 

SUBJECT  DEMOGRAPHICS 

EXPERIENCE!    INEXPERIENCES 
-f + + 

MALE  MILI'iARY    !      y       \  8  \ 

+ + + 

5EMALE   MLITARY     j  2  j  1  | 

MALE    CIVILIAN         j  0  j  1  j 

4 4 4 

FEMALE  CIVILIAN  |      0       j      2       | 
TOTAL:  12  12 
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Twenty-one  of  these  subjects  were  military  officers,  in  the 
grade  cf  03  end  04,  representing  the  four  services;  three 
were  froir.  civilian  agencies.  Six  cf  the  twenty-four  were 
female.  These  subjects  were  at  a  ievei  in  their  careers 
where,  especially  in  light  of  their  C3  tackground,  they 
might  reascnably  expect  to  be  at  the  rria-  tc  upper  levels  cf 
a  Command-Fcst  staff.  In  short,  they  were  prime  candidates 
fcr  using  voice  input  in  a  Ccmmand-Fcst  scenario. 

Cf  the  twenty-four,  half  haa  sere  prior  experience  with 
ASfl,  which  was  defined  for  the  purposes  of  the  experiment  as 
participation  in  three  or  more  experiments  at  the 
Postgraduate  School;  the  ethers  had  none.  The  sexes  were 
evenly  divided  oetween  the  two  groups.  Batchellor  iRef.  10] 
found  no  statistically  significant  difference  in  recognition 
rates  oetween  sexes  with  the  particular  recognizer  teing 
usee.  The  taianced  sample  (clocked  on  sex)  was  used 
nonetheless  to  control  for  any  unforeseen  effects  due  tc 
genaer. 


C.   VOCABULARY 

iior  tnis  experiment,  subjects  trained  a  fifty  word 
vocaouiary  (Appendix  A)  which  included  sorre  numerals  and 
various  other  utterances.  The  vocabulary  was  chosen  with  a 
specific   purpose   in   mind.  It  has  teen  shown  IRef.ieJ  that 


correct  recognition  cf  a  word  or  phrase  is  directly  reiated 
10  the  length  of  the  utterance.  Id  this  light,  sorre  words 
v/ere  delicerateiy  chosen  over  others  to  avoid  confusion  or 
possible  rrisreccgnit  ion.  (5or  example  "send  message"  in  lieu 
of  the  shorter  "send'.)  Other  words  were  emitted  for  the 
sarre  reasons.  Some  of  the  vocabulary  iterrs  were  used 
infrequently,  tut  were  necessary  for  the  task.  In  suit;,  the 
vocabulary  chosen  was  the  minimum  felt  necessary  to  enable 
composing  short  rressages  with  a  irodicum  of  realism  and  some 
variety . 

I.   TASK 

Eacn  subject  was  given  a  scenario  consisting  of  2tJ£ 
utterarces  (Appendix  B),  with  each  wora  or  phrase  nurroered. 
These  utterances,  when  correctly  interpreted  oy  the 
recognizer,  brought  up  on  the  screen  an  output  resembling 
messages  which  might  be  sent  or  serviced  by  a  shiptoard 
Command  Post.  (The  ouput  from  these  message  sequences  is 
shown  in  the  right-hand  column  of  Appendix  B,  which  was  not 
furnished  to  experiment  participants;  for  examples  cf  the 
types  of  rressages  see  Figure  1.)  Certain  utterances  produced 
a  mul tipie-word  output,  simulating  message-handling 
utilities  such  as  are  resident  od  the  ARPANET,  with  which 
all  suDjects  were  familiar.  Verisimilitude  was  an  important 
consideration  in  the  design  of  tne  tas&: 
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SAMPLE  MESSAGE  OUTPUT 


SENr  MSG  TO:/  CV  BATTLE  GRCUF  /  COMMANDER 

INFC  CCPT  TO:  /  COMTHIRDFLEET  /  CINCPACFLEET 

EROM:  /  USS  RATHBURNE 

PCSITICN  /  REPORT  /  NUMBER  /  1 

READINESS  LVL  /  2 

PCSSIBIE  /  ELECTRONIC  /  SURFACE  /  CONTACT 

BEARING:  /  1/8/0  /  DISTANCE  /  2/7 

CONFIDENCE  LVL/  3 

REQUEST  /  INSTRUCTIONS 

QUIT./ 


FORWARD  MSG  ft:    /    7 

INFO  CCPY  TO:  /  CV  BATTLE  GROUP  /  COMMANDER 

FROM:  /  USS  RATHBURNE 

QUIT./ 


(NB:  The  symbol  /  indicates  the  separation 
between  utterances.  Line  separation  is 
accomplished  by  speaking  "RETURN"  which 
yields  a  carriage  return.) 


Figure  1 
SAMPLE  MESSAGE  CUTPUT 
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The  point  of  departure  of  tne  synthetic-work  approach  is 
a  behaviorai  analysis  cf  the  performance  requirement? 
placed  on  the  operator  by  some  specific  system,  or  by  a 
class  of  such  systems  in  general.  Tasks  are  then 
selected  against  a  criterion  of  content  validity  (i.e., 
tasks  are  selected  because  they  measure  functions  judged 
by  experts  in  the  Held  to  be  important  to  the 
operational  situation  cf  interest).  A  general  criterion 
of  face  validity  is  also  imposed  (i.e.,  tne  tasks  art 
configured  to  be  acceptable  to  target  populations,  e.g., 
pilots  ...    [Ref.ll:  pp.  22-23] 


Subjects  were  to  speak  the  utterances  in  order,  each  one 
tntil  the  proper  output  appeared  en  their  screen.  Each 
subject  v»as  given  the  same  scenario  three  times  in  the 
course  cf  tne  experiment,  with  at  least  one  day  (tut  no  more 
than  two)  tetween  reprises.  On  subsequent  trials,  subjects 
were  told  that  the  scenario  they  were  given  was  of  exactly 
the  same  length  and  of  similar  content  to  the  one(s)  they 
hac  seen  previously.  The  sane  scenario  was  used  repeatedly 
to  control  for  variability  of  the  task,  and  it  was  felt  that 
the  scenario  was  of  sufficient  length  and  variation  to 
negate  any  significant  learning  effect. 

E.   TIME  STRESS  FACTOR 

During  the  first  phase  of  the  experiment  (Phase  1), 
subjects  were  allowed  to  complete  the  scenario  at  their  own 
pace,  and  were  exhorted  not  to  rush  or  needlessly  linger. 
(cf.  Appendix  C).  The  experimenter  attempted  to  put  them  at 
ease.  Unknown  to  the  subjects,  they  v/ere  being  timed,  and 
their  times  to  complete  the  scenario  recorded.  This  time  was 
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to  serve  as  an  individual  baseline  against  whicb  subsequent 
phases  would  be  timed.  In  the  next  phases  (Phases  2  and  2} 
subjects  were  allotted  two-thirds  and  then  one-third  of 
their  baseline  times,  respectively,  tc  attempt  to  complete 
the  scenario.  It  was  anticipated  that  not  all  subjects 
■*ouid  complete  the  scenario  under  the  shortened  time 
allowance:  scoring  was  adjusted  to  reflect  the  amount  of  the 
task  actually  completed,  (cf.  IV. B  infra) 

The  time-corpressior  was  intended  to  induce  stress  in 
the  sucjects.  They  were  informed  at  the  outset  of  how  much 
time  they  would  have  to  complete  a  tasK  with  which  they  were 
already  farriiiar.  A  large  &ra-Lab  darkroom  tirrer  was  plainly 
visible  in  front  of  the  subjects  tc  shew  time  remaining. 
Additionally,  the  experimenter  rerrind^d  the  subjects 
insistently  at  thirty-second  intervals  of  their  time 
remaining.  Subjects  could  guage  their  own  prcgress  in  the 
'c/j'ti  item  scenario.  Apriori,  it  was  felt  that  the  two-thims 
time  would  constrain  some  out  not  ail  subjects;  it  was 
reasonably  certain  that  allowing  only  one-third  of  the 
baseline  time  would  do  so.  This  opinion  seeired  to  oe  borne 
out  in  pilot  trials  (with  subjects  not  used  in  the  later 
experiment ) . 

i.   EQUIPMENT  ANE  SETUP 

The  voice  recognition  system  employed  was  the  T600 
recognizer   manufactured   by  Threshold  Technology,  Inc.  (See 
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iigure  £  for  a  dIolk  diagram  or  the  system.)  Ibis  is  a 
discrete  utterance  analyzer,  which  analyzes  samples  of 
speech  up  to  a  maximum  of  two  seconds  in  length,  separated 
by  a  distinct  pause  of  at  least  one-tenth  of  a  second. 
Utterances  of  more  than  one  word  are  allowed,  as  long  as  tne 
words  are  run  together  and  pronounced  "naturally",  without 
distinct  creaKs  between  them. 

The  i6"£i6  has  a  maximum  capacity  of  ii56  utterances.  In 
the  training  rrcde,  the  subject  repeats  a  word  or  phrase  ten 
tirres.  (Under  certain  circumstances,  fewer  repetitions  rray 
oe  used,  tut  this  requires  modification  of  tne  sortware  and 
yields  less  consistent  results  [Hef.  10]/.  After  the  tenth 
repetition,  tne  ten  are  superimposed  and  normalized  in 
length  to  fcrm  a  template  for  that  utterance.  Turing  this 
experiment,  a  training  standardization  measure  was  adopted. 
When  the  subjects  had  finished  training,  they  repeated  the 
vocabulary  list  in  toto  at  least  twice;  any  items  not 
correctly  recognized  on  at  least  two  of  three  passes  were 
retrained . 

When  the  user  speaks,  his  speecn  sample  is  compared 
against  all  these  templates  for  the  best  match.  If  within 
preset  tolerances,  the  utterance  is  recognized  (correctly  or 
not)  and  the  output  string  sent  to  the  display  screen.  If  no 
natch  is  founa,  tne  machine  produces  an  audible  "beep".  Each 
individual  may  store  his  voice  template  en  a  data-tape 
cartridge  for  future  use,  and  thus  typass  the  lengthy 
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training  process  in  future  sessicns.  Each  utterance  is 
programmed  by  the  user  (in  this  case,  the  eiperimenter)  with 
a  prompt  for  training.  When  that  utterance  is  cued  at  the 
Keyboard,  the  prorrpt-string  appears  on  the  display;  it 
disappears  after  the  tentn  repetition.  After  training,  the 
recognizer  shifts  to  the  recogniton  rrcde  and  awaits  verbal 
input.  When  utterances  are  recognized,  an  associated 
preprogrammed  output  string  is  sent  to  the  display  screen. 
Tne  output  string  corresponding  to  an  utterance  is  user- 
selected,  allowing  up  to  a  maximum  of  sixteen  alphanumeric 
characters. 

When  used  in  conjunction  with  a  computer,  the  T600  rray 
be  used  in  either  direct  or  buffered  modes.  "Eirect"  sends 
the  output  string  to  tne  corrputer  without  delay.  The 
"Buffered"  mode  allows  the  user  to  verify  the  output  before 
allowing  it  to  te  sent.  The  direct  mode  was  used  here.  In 
this  experiment,  the  T€00  was  used  in  a  stand-alone 
capacity:  output  strings  were  seLt  to  the  display  screen  and 
nowhere  else,  (for  further  discussion  see  [Ref.  12] ) 

The  microphone  used  was  a  Shure  SP10  noise  cancelling 
unit.  It  was  worn  oy  all  subjects  on  the  left  side  of  the 
head,  directly  below  and  slightly  to  the  left  of  center  of 
the  lips.  Tc  further  attenuate  echc  and  outside  noise,  the 
experiment  was  conducted  inside  an  Industrial  Acoustics 
Company  soundproof  chamber.  (See  Figure  3  for  the  actual 
equipment  setup) . 


.figure  3 
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IV.   EXPERIMENTAL  ISSIGN 

A.  OBJECTIVE 

The  primary  objective  was  to  see  if  time  stress  affected 
users'  recognition  rates  with  this  voice  reccgiiiii.cn  setup. 
After  the  initial  training  session,  (and  after  a  delay  of 
three  to  1'our  days)  subjects  began  the  portion  cf  the 
experiment  which  involved  the  scenario.  They  were  fitted 
with  tne  microphone  and  handec  the  scenario.  For  each 
respective  phase,  they  were  read  the  instructions.  in 
Appendices  C,D,  and  E.  The  intention  was  to  induce  stress  in 
suDjects  in  the  iast  two  phases  cf  the  experimert  by 
progressively  shortening  the  tirre  allowed  to  perform  a  task 
of  fixed  length.  Compression  cf  time  allowed  was  to  be 
correlated  with  percent  correct  recognition  rate. 

It  was  hcped  that  several  factors  were  to  be  at  work  en 
the  participants.  There  was  a  definite  corrpetition  for  the 
subjects'  mental  resources.  Subjects  haa  to  reaa  a  wora  or 
phrase  from  their  printed  scenarios,  speak  it,  and  verify 
tnat  it  appeared  correctly  on  their  screen.  If  misrecognized 
cr  "beeped",  they  were  to  try  again  until  correctly 
recognized.  The  experimenter  instructed  subjects  to  proceed 
to  the  next  scenario  item  if  they  had  five  unsuccessful 
attempts,  although  they  had  no  foreknowledge  of  this.  This 
was   yet  another  factor  in  the  stress  equation:  failure  at  a 
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tasK  is  in  itself  stressful  for  most   individuals  .  [Hef .   13: 

p. 151] 

At  the  cutset  of  each  of  the  last  two  ( time-constrained) 

phases  of  the  experiment,  subjects  were  informed  of  how  much 

time  they  would  De  allotted  to  perform  the  scenario.   After 

the   second  and  Before  the  last  phase,,  many  indicated  to  the 

experimenter  that  they  "inew"  they  would  have  less  time  than 

in   the  previous  session.  This  suspicion  was  desired  and  was 

calculated  to  create  a  measure  of  negative  anticipation   in 

the  participants.   During   the  last   two  phases   of   the 

experiment,   subjects  were   reminded   insistently  by   the 

experimenter   of  time  remaining,  in  ac.dition  to  the  presence 

of  the  highly  visible  timer.   It  was   estimated   that   this 

combination  of  factors  would  induce  stress: 

The  apprehension  of  psychological  stressors  is  a  much 
nore  complex  business.  It  depends  on:  (l)  prior 
experience  with  a  class  of  stressors;  (2)  remembering 
the  experience;  (3)  recognizing  that  the  present 
stimulus  gestalt  is  an  instance  cf  the  earlier  class  of 
stressors;  and  >v4 )  believing  that  the  livelihood  of  the 
threatened  occurrence  is  above  some  subjective 
tbresbnola.   [Ret.  13:  p.  142J 
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been   able   tc   induce   a  reasonable  amount  of  stress  in 
their  subjects.   [Ref.  9:  p.  176J 


The  fundamental  hypothesis   being   tested   here   may   be 
sumrred  up  as: 

He:  Tirre  stress  has  no  effect  oc  the  recognition 
rates  of  users  of  voice  recognition  systems. 

HI:  Tire  stress  affects  the  recognition  rates  of 
users  of  voice  recognition  systerrs. 

Additionally,  it  vas  hoped  to  see  whether: 

He:  Experienced  and  inexperienced  users  of  voice  recognition 
systems  are  affected  equally  by  time  stress. 

HI:  Experienced  and  inexperienced  users  of  voice  recognition 
systems  are  not  affected  equally  by  time  stress. 


H.   SCOBING 

Subjects  were  monitored  during  their  performance  of  the 
scenario:  the  experimenter  sat  to  the  right  of  and  behind 
subjects  (cf.  i'igure  3)  from  which  position  both  the 
subjects'  actions  and  the  display  screen  were  clearly 
visible.  Msreccgnit  ions  (improper  word  substitutions  by  the 
recognizer)  and  non-recognitions  were  recorded.  A  confusion 
matrix  of  substitutions  and  "beeps'  is  included  as   Appendix 

3^ 


i .  Subjects  occasionally  lost  their  place  in  the  scenario, 
repeating  some  vocabulary  items  they  had  already  spoken. 
These  repetitions  were  counted  as  additional  utterances,  and 
the  correct  or  incorrect  result  recorded?  thus,  no  penalty 
was  given  for  repetition.  The  caveat  in  the  instructions 
about  penalties  for  non-completion  of  the  scenario  was 
cogus.  It  was  added  to  provide  an  additional  incentive  tc 
finish,  and  in  tne  hope  of  reinforcing  the  subjects' 
apprehension.  (Subjects  completed  an  average  of  fifty-three 
percent  of  tne  scenario  in  Phase  3J  none  finished.) 
Recognition  rates  were  based  on  the  items  cf  the  scenario 
actually  completed.  Percentage  correct  recognition  was 
figured  as  tne  quotient  of  the  number  of  correct 
recognitions  divided  Dy  the  total  number  of  utterances 
spoken  by  the  subjects  (including  mis-  and  non- 
reocgnitions ) . 

At  the  end  of  each  phase  of  the  experiment,  subjects 
were  asked  to  rate  their  feelings  during  that  phase.  (See 
questionnaire,  Appendix  G).  Five  surrogate  descriptors  were 
used  to  circumscribe  the  idea  of  "stressed";  subjects  were 
asKed  if  they  felt  : 

1.  nervous 

2.  confused 
2.  pressured 
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4.  irritated 


c.  anxious 


on  a  scale  of  one  to  five,  five  being  the  highest  level  cf 
each.  It  was  thought  tnat  these  individual  response  levels 
might  sorrehow  te  related  to  recognition  rates. 

C.  CGMCmUAI  EISIGN  CF  THI  IXPZRIfENT 

The  design  employed  was  a  two-factor  mixed  design  with 
repeated  measures  on  one  factor.  [Hef .  14:  pp.  54-61] 
Alternatively  (and  for  purposes  cf  clarity)  the  design  might 
be  thought  of  as  involving  two  conditions 
(experienced/ inexperienced )  and  three  treatments.  (See 
iigure  4  for  a  graphic  interpretation). 

As  stated  previously,  the  treatments  consisted  of  three 
variations  of  time  in  the  testing,  constituting  the  three 
phases  (excluding  the  training  phase). 


1.  Phase  1 

2.  Phase  2 

3.  Phase  3 


baseline  time:  nc  constraint 


c/Z    baseline  time 


1/3  baseline  time 


These  phases  were  conducted  in  the  order  shown.  While  it 
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EXPERIENCED 


PHASE  1 
(BASELINE) 


PHASE  2  PHASE  3 

(2/2  BASELINEMl/3  BASELINE) 


INEXPERIENCE! 


SUBJECT  1 


SUBJECT  12 


SUBJECT  13 


SUBJECT  24 


Figure  4 


CONCEPTUAL  DESIGN  OF  THE  EXPERIMENT 
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might  be  thought  preferable  in  the  classical  design  of 
experiments  to  randomize  the  administration  of  the  time 
constrained  portions,  this  was  not  done,  Randomization 
might  have  controlled  for  the  possible  effects  of  "learning" 
between  Fhases  2  and  3?  this  benefit  was  consciously 
sacrificed  in  order  to  introduce  the  apprehension  factor 
discussed  above.  It  was  suspectea  apriori  that  the  drastic 
time  compression  from  the  second  tc  the  third  phase  of  the 
experiment  would  help  to  compensate  for  the  possible 
confounding  introduced  by  "learning".  Moreover,  the  scenario 
was  felt  to  be  long  and  varied  enough  to  counteract  learning 
in  some  measure,  in  addition  to  the  experimenter's  deception 
that  successive  scenarios  were  net  exactly  the  same.  The 
fact  that  all  phases  were  separated  by  no  less  than  a  day 
•vand  no  mere  than  two)  was  also  felt  to  counteract  learning. 
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V.   RESULTS  AND  ANALYSIS 

A.   ANALYSIS 

The  results  of  the  experiment  as  described  are  shown 
graphically  in  Figure  5.  Before  conducting  any  analysis, 
Hartley's  test  for  equality  of  variance  was  performed  on  the 
raw  data.  The  results  indicated  that  equality  cf  variance 
was  an  invalid  assumption,  and  thus  the  raw  data  required 
transformation:  after  arcsln  transf orrration,  the  equality  of 
variances  was  accepted.   (See  figure  6) 

The  analysis  of  variance  table  is  presented  in  Figure  7. 
As  is  reaaily  apparent  from  the  graph  cf  the  raw  data, 
recognition  rates  for  the  experienced  group  decrease 
monotcnically  with  time  compression.  This  is  also  true  for 
Phases  c  and  3  of  the  inexperienced  group.  The  F  statistic 
shows  a  significant  difference  between  trials.  Therefore, 
the  null  hypothesis  of  time  stress  not  affecting  recognition 
rate  is  rejected. 

An  interesting  phenomenon  is  observed  with  the 
inexperienced  group,  however.  From  Phase  1  to  Phase  c  their 
success  rate  increases  noticeably,  despite  the  decreased 
tiire  allotted  for  the  tasK.  This  is  hypothesized  to  be  the 
result  of  a  combination  of  factors,  the  rrost  obvious  of 
which  is  learning.  From  their  first,  open-ended  trial  to 
the  second,  the  inexperienced  group  tecaire  more  adept  at  the 
tasK.  Moreover,  given  their  relative  naivete  with  the 
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SUMMARY  OF  EXPERIMENTAL  RESULTS 


RAW  LATA 
(For  percentages,  multiply  by  100) 


ST 


Phase  1  .959  .0033 

EXPERIENCED     Phase  2  .947  .0012 

Phase  3  .897  .0041 

Phase  l  .907  .0081 

INEXPERIENCED    Phase  2  .941  .0023 

Phase  3  .910  .0037 


for  afl-t-6  ana  df  2=(n-l  )  =  11 , 

Hartley's  test,  Frrax  =  6.32  (alpha=.25). 

lor  the  untransf erred  data: 


S^ax/Smir  =  .0061/. 0012  =  6.75 
Therefore,  reject  equality  cf  variances;  transform  data 
After  the  transf criration  (y  =  2(  arcsin  \j~x"*  )): 

Smax/Smin  =  .0428/. 0121  =  3.537 


Accept  equality  of  variances. 


Figure  6 
LATA  SUMMARY  AND  HARTLEY'S  TEST 
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SOURCE 


SS 


df 


rrs 


Total 


±.292  71 


Between 
subjects 


2.074 


Experience      .0642 


.0642     .702 


Error  (b)       2.041 


.0914 


V»ithir 
subjects 


2.216      48 


Tire  Factor 


.3242 


.1621     4.27      <.05 


TirrexEiperience   .2216     ?  .1106    2.92   ,05<p<.10 


Error  (w) 


1.672      44 


£36 


figure  7 
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equipment  setup  ana  the  task,  inexperienced  subjects  took: 
longer  to  complete  the  initial  phase  of  the  experiment  and 
thus  had  a  longer  Daseline  (an  average  of  7.6fa  minutes, 
versus  6.4S  for  the  experienced).  In  the  linear  compression 
cf  Phases  2  and  3,  they  simply  had  more  time  than  the 
experienced  group. 

It  was  not  known  apriori  if  allowing  two-thirds  of 
baseline  time  in  the  second  phase  would  impose  a  felt  tinre- 
constraint  on  all  subjects.  lor  this  reason,  one-third  was 
chosen  for  the  final  phase:  although  seemingly  impossible, 
it  was  felt  that  such  a  drastic  time  reduction  woild  induce 
stress  in  the  majority  cf  the  subjects .  Such  appears  tc  be 
the  case.  Experienced  subjects  show  a  aecline:  in  each 
successive  phase  cf  the  experiment.  Inexper iencec  subjects, 
although  theoretically  still  on  the  upward  side  of  their 
"learning  curve"  show  a  pronounced  decline  from  Phase  2  tc 
Phase  2. 

For  the  inexperienced  group,  both  the  rapidity  cf  the 
learning  effect  and  the  quickness  with  which  it  was  negated 
have  additional  significance.  A  Newman-Keuis  range  test 
'valpha  =  .05)  was  performed  for  differences  in  the  means 
amongst  phases  of  the  experiment.  IRef.  16:  pp.  35-27] 
Phases  1  and  2  were  seen  to  oe  significantly  different  from 
Phase  3,  cut  were  not  dif f erentiable  between  themselves: 
this  is  obviously  aue  to  the  inexperienced  group  starting  at 
a  lower  rate,  then  coming  up  to  ueet  the  experienced  group's 
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mean  sccre.  A  two-sample  t  -test  shows  a  difference  between 
the  experienced  era  inexperienced  groups  only  in  the  first 
phase.  (At  an  alpha  critical  of  .£69).  In  subsequent  phases 
of  the  experiment,  the  two  groups  are  statistically 
indistinguishable.  This  is  further  confirmed  by  the  F 
statistic  fcr  experience  in  the  analysis  of  variance.  The 
inference  to  be  drawn  from  this  appears  to  be  that  the 
effects  cf  learning  en  this  type  of  equipment  are  so  rapid 
as  to  maze    the  two  groups  quicKly  very  sirrilar. 


JB.   CGBEILATICN 

iirst,  an  attempt  was  mane  to  correlate  subjects' 
questionnaire  responses  with  tfl€  phase  of  the  experiment  tc 
which  they  were  responding.  It  was  desired  to  see  if 
response  levels  were  somehow  dependent  on  the  time  stress 
levels  imposed  or.  the  participants.  If  so,  a  positive 
correlation  wcuid  show  a  rising  response  level  to  one  or 
mere  cf  the  five  questions  as  time  compression  increased 
with  experiment  phase.  The  data  to  be  correlated  were 
nominal  (phase)  versus  ordinal  (1  to  5  en  questionnaires)  so 
tne  nenparametric  Spearman's  Bho  was  used.  [Ref.  17:  pp. 
j 


252-256]  Table  2  shows  the  results: 
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Table  2 
Correlation  oi'  Questions  with  Phase 


QUESTION 


EHO 


1.  nervous 

2.  confused 

3.  pressurea 

4.  irritated 

5.  anxious 

(*  indicates  significant  at  alpha  less  than  .05) 


.234 

* 

.017 

C7  *7 
•  *~  ^  •— 

V 

.e20 

.230 

* 

Thus,  three  of  the  five  4uestions  shew  a  strong  correlation 
'«ith  the  lime  compression  with  phase.  Moreover,  these  tnree 
particular  descriptors  (nervous,  pressured,  and  anxious) 
were  thought  to  be  the  closest  surrogates  of  the  five  tc 
otherwise  jescribe  the  situational  or  time  stress  desired. 
(The  otner  two  show  no  correlation.)  It  appears  that  the 
experimental  conditions  were  successful  in  inducing  stress 
in.  sore  rreasure.  Mean  response  rates  for  subjects'  answers 
are  shewn  in  figures  6-12. 

Next,  an  attempt  was  made  to  correlate  subjects' 
questionnaire  responses  (on  the  five  descriptors  already 
rentioned)  to  their  recognition  rates  with  the  system.  It 
was  suspected  beforehand  that  response  levels  tc  one  or  more 
of  tnese  questions  rright  increase  rronotonically  from  one 
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phase  lo  the  next  during  the  conduct  of  the  experiment.  It 
is  readily  apparent  that  this  was  not  the  case.  While  the 
five  surrogate  descriptors  were  deeded  capable,  in  the 
aggregate,  tc  describe  stress,  they  apparently  were  not 
completely  successful  as  a  subjective  measure. 

The  ris^  of  self-inhibition  is  always  run  when  asking 
subjects  tc  rate  their  own  perceptions,  emotions,  etc.  [Ret*. 
18:  pp.  501-502]  Self-image  or  self-esteem  may  prohibit  a 
respondent  from  admitting  of  a  higher  level  of  what  he  may 
feel  to  be  an  "undesirable"  feeling  cr  trait.  This  may  have 
been  the  case  here.  In  the  course  of  the  eiperiment,  the 
experimenter's  subjective  assessment  of  the  subjects' 
emotional  states  did  not  agree  well  with  their  questionnaire 
responses.  Cn  several  occasions,  subjects  left  the 
experiment  giving  every  appearance  of  being  agitated  and 
exercised,  naving  answered  with  a  "l"  (i.e.  not  at  all)  to 
all  five  questions . 

Moreover,  although  instructed  to  respond  only  in  regard 
to  each  individual  phase  of  the  experiment,  it  appears 
lively  that  subjects'  responses  may  have  been  in  the  context 
cf  their  previous  experiences  in  the  experiment.  Thus  one 
sees  the  rise  in  most  responses  from  the  first  (untimed)  to 
the  second  phase  of  the  experiment.  Turing  the  final  phase, 
subjects  were  expecting  to  be  placed  under  a  tighter  time 
constraint  (the  desired  apprehension  effect).  This  may  in 
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part  explain  the  absence  of  a  rise  in  response  levels  for 
this  last  phase. 

The  data  gathered  from  the  questionnaires  was  in  the 
ordinal  (1  to  5)  scale.  Since  an  atteirpt  was  to  be  made  to 
correlate  these  responses  with  recognition  rates  (interval 
aata)  recourse  was  again  made  tc  the  r.on-parametric 
Spearman's  Rho  procedure  for  correlating  the  ranks  of  both 
sets  of  data.  Recognition  rates  were  tested  against 
responses  to  all  five  questions  for  each  phase  of  the 
experiment.  Results  generally  shoved  do  correlation. 

One  question  alone  provided  a  rather  consistent  negative 
correlation  with  recognition  rates.  For  the  ineiperienced 
subjects,  question  £  ("confused")  showed  a  strong  negative 
correlation  (rho<  -0.4)  for  all  three  phases  of  the 
experiment  (credible  at  an  alpha  less  than  or  equal  tc  0.1). 
This  would  imply  that  as  confusion  level  rose,  recognition 
rate  went  down.  It  is  interesting  tc  note  that  there  was  no 
correlation  found  "between  this  variable  and  recognition  for 
the  experienced  group.  The  only  other  question  which  shewed 
sorre  (negative)  correlation  was  the  fourth  ("irritated"), 
but  this  relation  was  at  most  tenuous  ( e.25>alpha>0 . 1 ) .  Also 
of  note  is  the  fact  that,  after  the  initial  pnase  of  the 
experiment,  response  levels  for  the  two  groups  parallel  one 
another,  in  some  instances  very  closely.  This  would  appear 
further  to  corroDorate  the  hypothesis  that  differences 
oetween  the  two  groups  diminish  rapidly. 
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VI.   REMARKS 

If  psychological  stress  has  an  effect  on  the  voice,  -is 
is  generally  acknowledged,  it  is  reasonable  to  expect  it  to 
have  an  effect  on  the  success  rates  of  users  of  voice  input 
equipment.  As  discussed  previously,  ASR  technology  is  of 
potentially  great  interest  to  the  rrilitary  planner  in  the 
Command,  Control,  and  Communications  context.  This 
preliminary  inquiry  has  attempted  tc  shew  a  correlation 
between  stress  ana  recognition  success  with  such  equipnent. 
Several  areas  of  consideration  suggest  themselves. 

As  has  been  seen,  recogniton  rates  declined  when  the 
operators  of  this  ASR  equipment  were  placed  under  tire 
stress.  An  alternative  approach  to  evaluating  the 
desiraoiiity  cf  this  technology  might  be  to  assess  vcice 
versus  other  means  of  input  under  sirrilar  conditions.  Tn 
this  study,  subjects  attempting  to  complete  a  nearxy 
impossible  task  (Phase  3)  nonetheless  managed  to  maintain  a 
correct  recognition  rate  cf  approximately  ninety  percent. 
With  manual  input  (e.g.,  typing),  each  characer  to  be  input 
represents  an  opportunity  for  error:  how  this  mignt  be 
affected  ty  psychological  stress  is  open  to  speculation, 
when  one  considers  that  a  sixteen  character  output  string 
can  be  produced  with  one  voice  cemmand,  it  may  well  be  that 
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voice  input  becomes  even  more  desirable  Id  a  stressful 
environment . 

It  is  interesting  to  note  how  quickly  the  inexperienced 
subjects  'caught  up"  with  and  paralleled  the  performance  of 
their  experienced  counterparts.  If  this  is  indeed  a 
generalizable  jhenomenon ,  it  would  inply  that  after  some  few 
training  sessions  with  a  recognizer,  th?  distinction 
vanishes.  If  so,  faced  with  a  potentially  adverse  situation 
a  commander  or  watch  officer  need  not  worry  about  this 
experience-level  in  his  Command  Post  staffing.  Voice  input 
has  also  been  proposed  for  automated  reservation  and 
directory-assistance  schemes.  Given  the  success  rates  for 
inexperienced  users  even  under  conditions  calculated  to 
induce  stress,  these  applications  appear  no  less  feasible. 

Lastly,  it  is  generally  acknowledged  that  a  more 
objective  method  for  assessing  psychological  stress  is 
desperately  needed,  Any  further  work  in  this  area  should 
find  an  alternative  and  hopefully  more  workable  method  of 
assessing  stress  in  subjects.  It  is  submitted  that  some 
relatively  non-intrusive  physiological  measures  of  stress  be 
considered.  Several  of  these  observable  and  quantifiable 
indicators  are  well  documented  in  existing  stress 
literature . 
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APPENDIX  A 


^VOCABULARY  LIST* 


WORD 


WORD 


0.  ZERO 

25. 

UNKNOWN 

1.  ONE 

26. 

POSSIBLE 

2.  TWO 

27. 

CERTAIN 

2.  THREE 

28. 

CONFIDENCE 

4.  FOUR 

29. 

DAMAGE 

5.  FIVE 

20. 

ASSESSMENT 

6.  SIX 

21. 

REQUEST 

?.  SEVEN 

22. 

INSTRUCTIONS 

6.  EIGHT 

READINESS 

9.  NEGATIVE 

24. 

INTENT 

10.  TERMINATE 

25. 

ATTACK 

11  .  SEND  MISSAGI 

26  . 

REPORT 

12.  ANSWER  MESSiGE 

27. 

RATHBURNE 

12.  FORWARD  MESSAGE 

28. 

FRCM 

14.  CINCPACTLIET 

29. 

COMMANDER 

15.  THIRD FLEET 

40. 

SURFACE 

16.  BATTLE  GROUP 

41. 

TIME 

IV.  INFORMATION 

42. 

ZUIU 

IS.  RETURN 

42. 

PERIOD 

ly.  RADAR 

44. 

POSITION 

20.  VISUAL 

45. 

NUr'BER 

21.  ELECTRONIC 

46. 

BEARING 

22.  CONTACT 

47. 

DISTANCE 

22.  SIGHTING 

48. 

UNDER 

24.  SOVIET 

49. 

ADVISE 

APPENDIX  B 


•SCENARIO  LISTING* 


CRT  OUTPUT 


1.  SEND  MESSAGE 

2.  BATTLE  GROUP 

3.  COMMANDER 

4.  RETURN 

5.  INFORMATION 

6.  TEIRDFLEET 

7.  CINCPACFLEET 

8.  RETURN 

9.  FROM 

10.  RATHBURNE 

11.  RETURN 

12.  POSITON 

13.  REPORT 

14.  NUMBER 

15.  ONE 

16.  RETURN 

17.  READINESS 

18.  THREE 

19.  RETURN 

20.  POSSIBLE 

21.  ELECTRONIC 


SEND  MSG  TO:  CV  BATTLE  GROUP  COMMANDER 

INFO  COPY  TO:  COMTHIRDFIIET  CINCPACFLEET 

FROM:  USS  RATHBURNE 

POSITION  REPORT  NUMBER  1 

READINESS  LVL  3 

POSSIBLE  ELECTRONIC  SURFACE  CONTACT 

BEARING:18tf  DISTANCE  27 

CONFIDENCE  LVL  3 

REQUEST  INSTRUCTIONS 

QUIT. 


22.  SURFACE 

23.  CONTACT 

24.  RETURN 

25.  BEARING 

26.  ONE 

27.  EIGHT 

28.  ZERO 

2y.  DISTANCE 

30.  TWO 

31.  SEVEN 

32.  RETURN 

33.  CONFIDENCE 

34.  THREE 

35.  RETURN 

36.  REQUEST 

37.  INSTRUCTIONS 

38.  TERMINATE 


39.  FORWARD  MESSAGE 

40.  SEVEN 

41.  RETURN 

42.  INFORMATION 

FORWARD  MSG  #:7 

43.  BATTLE  GROUP      INFO  COPY  TO:  CV  BATTLE  GROUP  COMMANDER 

FROM:  USS  RA1BBURNE 

44.  COMMANDER 

45.  RETURN 


CR 


QUIT. 


46.  FROM 

47.  RATHBURNE 

48.  TERMINATE 


49.  ANSWER  MESSAGE 

50.  TWO 

51.  FOUR 
b2.  RETURN 

53.  CONTACT 

54.  UNKNOWN 

55.  POSSIBLE 
te.  SOVIET 

57.  PERICE 

58.  INTENT 

59.  UNKNOWN 

60 .  TERMINATE 


ANSWER  MSG  #:  24 

CONTACT  UNKNOWN  POSSIBLE  SOVIET, 
(no  carriage  return  in  scenario) 
INTENT  UNKNOWN 

QUIT. 


61.  SEND  MESSAGE 

62.  BATTLE  GROUP 

63.  COMMANDER 

64.  RETURN 

65.  INFORMATION 

66.  THRIEHEET 

67.  CINCPACiLEET 

68.  RETURN 
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69.  CONTACT 

70.  SIGHTING 

71.  REPORT 

72.  NUMBER 

73.  SIX 

74.  RETURN 
7£.  TIME 

76.  ONE 

77.  SEVEN 

78.  EOUR 

79.  EIGHT 
£0.  ZULU 
ei.  RETURN 
£2.  POSITION 
83.  ONE 

£4.  JIVE 

£5.  SEVEN 

£6.  TWO 

£7.  ZERO 

£8.  RETURN 

£9.  COMMANDER 

90.  ASSESSMENT 

91.  POSSIBLE 

92.  SOVIET 

93.  SURFACE 


SEND  MSG  TO:  CV  BATTLE  GROUP  COMMANDER 

INFO  COPT  TO:  CCMTHIRDFLEET  CINCPACFLEET 

CONTACT  SIGHTING  REFORT  NUMBER  6 

TIME  1748  ZULU 

POSITION  15720 

COMMANDER  ASSESSMENT  POSSIBLE  SOVIET  SURFACE. 

(no  carriage  returr  in  scenario) 

UNKNOWN 

QUIT. 
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94.  PEF.IOE 

95.  UNKNOWN 

96.  TERMINATE 


97.  ANSWER  MESSAGE 

98.  THRU 

99.  EIGHT 

100.  RETURN 

101.  INFORMATION 

102.  CINCFACF1FET 
102.  RETURN 

104.  FROM 

105.  RATHEURNE 

106.  RETURN 

107.  UNKNOWN 
106.  SURFACE 
109.  CONTACT 
lie.  PERIOD 

111.  POSSIE1E 

112.  SOVIET 
112.  RETURN 

114.  UNKNOWN 

115.  RADAR 

116.  CONTACT 

117.  BEARING 
US.  TWO 


ANSWER  M3G  #:  26 

INFO    COPY    TO:    CINCPACUEET 

FROM:    DSJi    RATH  EUR  ME 

UNKNOWN  SURFACE  CONTACT.  POSSIBLE  SOVIET 

UNKNOWN  RADAR  CONTACT  BEARING:  235 

DISTANCE  14 

NEGATIVE  VISUAL  CONTACT. 

CONFIDENCE  LVL  ASSESSMENT  6. 

INTENT  UNKNOWN.  REQUEST  ADVISE 

OUIT. 
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119.  THRU 

12 e.  FIVE 

121.  RETURN 

122.  DISTANCE 
122.  ONI 

124.  FOUR 

125.  RETURN 

126.  NEGATIVE 

127.  VISUAL 

128.  CONTACT 

129.  PERIOD 

130.  RETURN 

131.  CONFIDENCE 

132.  ASSESSMENT 

133.  SIX 

134.  PERIOD 

135.  RETURN 

136.  INTENT 

137.  UNKNOWN 
136.  PERIOD 

139.  REQUEST 

140.  ADVISE 

141.  TERMINATE 


142.  SEN!  MESSAGE 
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143. 

BATTII  GROUP 

144. 

COMMANDER 

145. 

RETURN 

146. 

INFORMATION 

147. 

THIRIFLEET 

14£. 

RETURN 

149. 

FROM 

150. 

RATHBURNE 

161. 

RETURN 

152. 

CONTACT 

153. 

REPORT 

154. 

NUMBER 

1  C^K 

X  w  w   • 

SIX 

156. 

RETURN 

157. 

CONTACT 

15g. 

SIGHTING 

159. 

RETURN 

i6e. 

CONTACT 

161. 

CERTAIN 

162. 

SOVIET 

163. 

SURFACE 

164. 

RETURN 

165. 

EEAPING 

166. 

TWO 

167. 

THREE 

SENE  MSG  TO:  CV  BATTII  GROUP  COMMANDER 

INFO  COPY  TO:  COKTBIREFLEET 

FROM:  USS  RATHBURNE 

CONTACT  HEFCRT  NUMBER  6 

CONTACT  SIGHTING 

CONTACT  CERTAIN  SOVIET  SURFACE 

BEARING:  237 

EISTANCE  14 

INTENT  UNKNOWN 

QUIT. 


6kJ 


166.  SEVEN 

16b.  RETURN 

17K.  DISTANCE 

171.  ONE 

172.  FOUR 
172.  RETURN 

174.  INTENT 

175.  UNKNOWN 

176.  TERMINATE 


QUIT. 


177.  SENT  MESSAGE 

176.  THIRDFLEET         SEND  MSG  TC:  CCrTHIRDFLEET 

INFO  COPT  TC:  CINCPACiLEET 

17b.  RETURN  FROM:  USS  RATHBURNE 

COMMANDER  ASSESSMENT  CV  BATTLE  GROUP 

182.  INECRMATICN        (no  carriage  return  in  scenario) 

UNDER  SOVI5T  ATTACK.  DAMAGE  LVL 

181.  CINCPACFLEET       (no  carriage  return  in  scenario) 

UNKNOWN.  REQUEST  ADVISE  USS  RATHBURNE 

182.  RETURN  (no  carriage  return  in  scenario) 

INTENT 
162.  FROM 

184.  RATHBURNE 

185.  RETURN 
166.  COMMANDER 
187.  ASSESSMENT 

186.  BATTLE  GROUP 
iey.  UNDER 
19e.  SOVIET 
lyi.  ATTACK 
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192.  PERIOD 

192.  DAMAGE 

194.  UNKNOWN 

195.  PERIOD 

196.  REQUEST 

197.  ADVISE 
196.  RATHEURNE 
19S.  INTENT 
24?e.  TERMINATE 


END        SCENARIO 

********************* 
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APPINIIX  C 


INSTRUCTIONS  TC  EXPERIMENT  PARTICIPANTS 


*PHASE  1* 


The  experiment  you  will  be  participating  in  simulates 
what  might  happen  in  a  shipboard,  command  post.  You  will  be 
using  the  vocabulary  you  have  already  Trained  during  the 
first  session. 

You  will  oe  given  a  list  or  wcrds  to  speak  to  the 
machine:  when  recognized,  they  wiii  come;  up  on  your  screen. 
Certain  words  produce  a  multiple  word  output,  as  you  have 
already  seen.  These  are  an  attempt  to  simulate  message- 
handling  utilities  such  as  are  found  on  the  ARPANET. 

Speak  clearly  and  distinctly,  tut  don't  overenunciate. 
Kc  provision  has  been  made  to  let  ycu  back  up  and  correct 
mistakes,  misreccgnitions,  etc.,  so  j-ou  rust  continue  to  say 
the  word  until  it  is  recognized.  If  you  rriss  a  word  and 
don't  catch  it,  I  will  prorrpt  you  for  the  word  you  missed? 
please  simply  try  again  on  the  were.  I  prompt  you  with. 
Pauses  between  wcrds  may  be  short,  but  they  must  be 
distinct . 

Set  your  own  pace, don't  rush  but  don't  linger.  You  are 
not  competing  against  anyone  (including  yourself)  for  time 
or  recognition,  so  don't  be  nervous.  Mo  grades  are  being 
given  out  here.  Once  you  start,  however,  please  continue  to 
the  end . 

I  will  be  glad  to  answer  any  questions  you  may  have 
about  the  conduct  of  tnis  phase  of  the  experiment.  Any 
questions  atout  the  purpose  and  outcome  I  will  be  glad  to 
discuss  at  threat  length  once  the  actual  experimentation 
phase  is  through. 

Thanks  again  for  your  help. 
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appeneix  r 

INSTRUCTIONS  TO  EXPERIMENT  PARTICIPANTS  '"PEASE  2* 


This  is  Phase  2  of  the  experiment.  In  this  phase  you 
will  participate  in  a  Command  Pest  scenario  very  similar  in 
content  ana  length  to  the  first  scenario  you  were  given. 

Tne  ircst  sign  if  leant  difference  in  this  phase  is  the 
time  factor.  In  Phase  2  you  will  be  allotted  only  2/3  of  the 
tirre  you  originally  too*  to  complete  the  entire  scenario.  In 
iact,  you  ray  not  be  able  to  complete  the  entire  scenario. 
It  is  to  ce  emphasized  that  your  primary  objective  is  tc 
corrplete  as  much  of  the  scenario  as  you  possibly  can,  as 
accurately  as  possible.  In  this  command  post  situation,  the 
messages  ycu  will  be  handling  are  cf  an  urgent  nature,  and 
your  priirary  rrotivaticn  is  to  get  as  rrar.y  of  then  completed 
as  possible.  Ycu  are  b  =  ing  scored  fcr  both  recognition  rate 
and  the  percentage  of  the  scenario  ycu  complete.  Any  parts 
you  ac  not  cor-  plete  will  count  against  ycur  overall  score, 
unfortunately .... 

During  tne  experiment,  I  will  give  you  frequent 
reminders  of  the  time  you  nave  remaining  at  30  second 
intervals.  The  tirre  I  announce  will  be  in  minutes  and 
seconds  remaining.  The  timer  will  also  furnish  ycu  with  a 
running  indication  of  time  remaining. 

Again,  attenpt  to  speak  clearly  and  distinctly.  Nc 
provision  has  been  made  to  back  up  and  correct  mistakes  or 
misrecogni tiens ,  sc  you  must  repeat  the  word  or  phrase  until 
it  is  correctly  recognized.  If  you  aiss  a  word  and  don't 
catch  it,  I  will  prompt  you  for  the  vcrd  ycu  rrissed:  please 
try  again  on  tne  word  I  prompt  you  with. 

Let  re  re-enphasize  here  that  the  shortened  time- 
aiicwance  is  meant  to  represent  the  urgency  of  the  situation 
in  the  Command  Post,  and  that  your  principal  goal  is  to 
finish  as  rruch  of  the  scencirio  as  possible. 

Thanks  again  for  ycur  help. 
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APPENDIX  I 
INSTRUCTIONS  TC  EXPERIMENT  PARTICIPANTS  *PH»SE  3* 


This  is  Phase  3  of  the  experiment.  In  this  phase  7013 
will  participate  in  a  Comn-and  Post  scenario  very  similar  in 
content  and  length  tc  the  first  scenario  yen  were  given. 

The  rrost  significant  difference  in  this  chase  is  the 
tirre  factor.  In  Phase  3  you  will  be  allotted  only  1/3  of  the 
time  yen  originally  took  tc  complete  the  entire  scenario.  In 
fact,  yon  may  net  be  able  to  complete  the  entire  scenario. 
It  is  to  te  errpnasized  that  ycur  priirary  objective  is  to 
ccmplete  as  much  of  the  scenario  as  yen  pcsslcly  can,  a? 
accurately  as  possible.  In  this  command  post  situation,  the 
passages  you  will  he  handling  are  cf  an  urgent  nature,  =  r_d 
ycur  primary  motivation  is  to  get  as  many  cf  them  completed 
as  possible.  You  are  being  scored  for  both  recognition  rate 
aod  the  percentage  cf  the  scenario  yon  complete.  Any  parts 
yen  dc  net  ccm-  plete  will  cennt  against  ycur  overall  sccre, 
unfortunately.  . .  . 

luring  the  experiment,  I  will  give  you  frequent 
reminders  cf  the  time  you  have  remaining  at  3?  second 
intervals.  The  tirre  I  announce  will  be  in  minutes  and 
seconds  reraining.  The  timer  will  also  furnish  you  with  a 
running  indication  of  time  remaining. 

Again,  attempt  tc  speak  clearly  and  distinctly.  Mo 
provision  has  been  made  to  "back  up  and  correct  mistakes  or 
misreccgciticns ,  sc  you  must  repeat  the  wcrd  or  phrase  until 
it  is  correctly  recognized.  If  yon  miss  a  word  and  don't 
catch  it,  I  will  prorpt  yon  for  the  wcrd  you  rrissed:  please 
try  again  on  the  wcrd  I  prompt  yen  with. 

Let  rre  re-emphasize  here  that  the  shortened  time- 
allowance  is  reant  to  represent  the  urgency  of  tne  situation 
in  the  Ccrrmard  Pest,  and  that  ycur  principal  ?cal  is  to 
finish  as  rruch  of  the  scenario  as  possible. 

Thanks  again  for  your  help. 
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APPENDIX  G 


QUESTIONNAIRE 

SUBJECT  NUMBER 


DATE 


PEASE 


Please  answer  the  following  questions  concerning  your 
feelings  during  the  phase  of  the  experiment  you  have  just 
completed.  These  questions  pertain  ONLY  to  this  phase. 

Circle  the  response  wbich  best  describes  your  feelings. 
The  scale  is: 


1  =  NOT  AT  ALL 
3  =  M0DERAT2IY  OR  SOMEWBAT 
5  =  VERY  MUCH  SC 
Responses  2   and  4  are  the  midpciEts  between  tbe  above. 


QUESTION:   (Circle  cne  for  each  question) 

During  this  phase  of  tbe  experiment,  I  felt: 


net  at  all 


nervous 

confused 

pressured 

irritated 

anxious 


c 

C 

2 


moderately 
3 
3 
2 
3 
3 


4 
4 
4 
4 
4 


very  ^uch  sc 


5 


^"^PLEASE!  Don't  discuss  your  experience  or  answers  with  anyone. 
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